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ABSTRACT 

We present the results of a novel new search of the first data-release from the Sloan Digital Sky Survey 
(SDSS-DRl) for the spectra of supernovae. The use of large spectroscopic galaxy surveys offers the 
prospect of obtaining improved estimates of the local supernova rate, with the added benefit of a very 
different selection function to that of conventional photometric surveys. In this Letter we present an 
overview of our search methodology and the details of 19 Type la supernovae found in SDSS-DRl. The 
supernovae sample is used to make a preliminary estimate, Fja = 0.4 ± Q.2h^ SNu, of the cosmological 
SNe rate. 

Subject headings: methods: data analysis — supernovae: general 



1. INTRODUCTION 

Supernovae (SNe) are generally discovered through dif- 
ference imaging techniques (see e.g. Perlmutter et al. 1995; 
Schmidt et al. 1998). However, it is also possible to detect 
them spectroscopically - by identifying the broad peaks 
and troughs that typically distinguish a SN spectrum from 
that of its host-galaxy. 

The spectroscopic approach to the detection of SNe has 
a number of advantages; namely that a detection can be 
made with only one observation, and that the type is read- 
ily identifiable without additional follow-up observations. 
Despite these advantages it is clear that a dedicated spec- 
troscopic SN survey is completely impractical - the rate 
of detection relative to telescope time is far too low. For- 
tunately, the data required for such a survey are already 
available from several large galaxy redshift surveys now ap- 
proaching completion. In a forthcoming paper, Mortlock, 
Madgwick & Hewett I2UU3I calculate that the 10^ galaxy 
spectra to be obtained by the Sloan Digital Sky Survey 
(SDSS, York et al. 2000) should yield - 200 SNe detec- 
tions. 

One immediate application for such a sample is that the 
cosmological SNe rate, F, can be constrained to < 10 per 
cent, a considerable improvement on existing single survey 
measurements of F (e.g. Pain et al. 11996. 2002. Hardin et 
al. 2002|). Moreover, using a galaxy redshift survey ensures 
that the host galaxy sample is extremely well defined, in- 
cluding redshifts, broad-band colours, spectroscopic and 
morphological types. 

The SNe presented in this Letter have typically been 
observed ~2-3 years previously, and so follow-up observa- 
tions are not possible. However, it is important to stress 
that if a search for SNe is carried out as soon as the spec- 
troscopic data in a given redshift survey is reduced there 
is potentially a significant amount of time in which more 
detailed follow-up observations can be carried out. 

This Letter describes the first results of an analysis of 
the ~ 10^ galaxy spectra in the SDSS Data Release 1 
(SDSS-DRl Abazajian et al. OnOHjl . The SNe detection 
procedures are described in §2, with the efficiency of the 
technique assessed using simulations in §3. The actual SNe 



discoveries are detailed in §4 before an initial estimate of 
the cosmological SN rate is presented in §5. The main 
results and future possibilities are summarised in §6. 

2. METHOD 

The spectroscopic detection of SNe relies upon the abil- 
ity to distinguish the broad spectral features typical of 
these objects in a composite host-galaxy-|-SN spectrum 
where the galaxy spectrum will usually dominate. Two 
key tasks need to be addressed in order to achieve this 
goal: 

1. The host-galaxy spectrum needs to be 'subtracted' 
from the observed spectrum, leaving a 'residual' 
spectrum. The residual spectrum should be domi- 
nated either, by noise (if the original spectrum was 
simply that of a galaxy), or, by SNe spectral features 
if these are present. 

2. The 'residual' spectrum then needs to be searched for 
the presence of characteristically broad SNe features, 
which must also be distinguished from any other ar- 
tifacts that have not been removed in the previous 
step. 

We outline here the steps we have adopted to carry out 
these tasks. 

2.1. Galaxy subtraction 

Although galaxy spectra appear superficially to exhibit 
a wide variety of forms, they can be very successfully pa- 
rameterised with only a handful of components. In partic- 
ular, the use of a principal component analysis (PCA) has 
proved to be well adapted to this task (see e.g. Connolly 
et al. Folkes, Lahav & Maddox IT^ Madgwick et 

al. 120023) • 

The key result of the PCA is the construction of an or- 
thogonal set of 'eigenspectra' - each of which successively 
incorporates the maximum amount of information about 
the observed galaxy spectral distribution. Previous analy- 
ses of samples of galaxy spectra have shown that as few as 
8 such eigenspectra can be used to accurately reconstruct 
most galaxy spectra (Folkes, Lahav & Maddox [l996|l . 
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In our analysis we perform PCA on a representative 
sample of 1000 galaxy spectra drawn from the SDSS, to 
construct a set of 20 eigenspectra. The eigenspectra pro- 
vide an optimal basis for reconstructing, and then sub- 
tracting, each host-galaxy spectrum. The extended wave- 
length range, excellent resolution and spectrophotometric 
qualities of the SDSS spectra mean that all 20 components 
contain information of significance for the spectroscopic re- 
constructions. However, our technique is not sensitive to 
the exact number of components employed and the use of 
only 10 eigenspectra yielded essentially identical results. 

The detailed procedure is as follows: 

1. The basis of 20 eigenspectra was constructed using 
PCA on 1000 galaxy spectra, chosen not to contain 
a strong signature of SNe spectra or any other spec- 
troscopic artifact or anomaly. 

2. Each observed spectrum in the SDSS~DR1 was then 
projected onto this basis. These projections quantify 
the relative importance of different components in 
the spectra of the galaxies and the observed spectra 
can be reconstructed very accurately from the 20 
eigenspectra. Unusual spectroscopic features (such 
as those of SNe) do not contribute to the form of the 
eigenspectra and are therefore not included in the 
reconstructed galaxy spectra. 

3. Each galaxy spectrum was then reconstructed from 
the eigenspectra. Subtracting the reconstructed 
spectrum from the original galaxy spectrum pro- 
duces a 'residual' spectrum. It is this residual spec- 
trum that will be searched for the SNe features. 

2.2. SNe matching 

The residual spectrum left after subtracting the recon- 
structed host-galaxy spectrum is almost always domi- 
nated by noise, validating the effectiveness of the PCA- 
reconstructions. However, spectroscopic artifacts were 
present for a small fraction of the objects due to poor sky 
subtraction or residual nebular emission lines, the latter re- 
sulting from particularly unusual emission line ratios. To 
reduce the impact of these relatively high-frequency resid- 
uals, a low-pass filtering of the residual spectra was ap- 
plied before attempting to identify SN features. The filter- 
ing was based upon the d trous wavelet transform (Starck, 
Siebenmorgen & Gredel 1997 and references therein), as 
described in Madgwick et al. 201)21) This additional filter- 
ing leaves a low-pass filtered residual spectrum, f\, allow- 
ing us to isolate just the broadest spectral features without 
contamination from narrow peaks and troughs present in 
the original observed spectrum. 

In order to quantify the significance of a SN detection 
we use two parameters - the correlation coefficient (here 
denoted by CC) and the RMS signal. The first of these 
parameters, CC, is determined simply by calculating the 
covariance between each low-pass filtered residual spec- 
trum, Ji {i = 0,.. iVgai) and a sequence of different SN 
templates, tj (j = 0, .. iVtcmp)\ 

^We make use of the observed spectra from SN1998aq at —9, 0, 
7, 19, 31 and 55 d ays (B ranch et al. 2003|. SN1998bu at -4, 8, 13, 
28 days (Jha e t al feggj and SN1999ee at -8, -1, 12, 20, 33 days 
(Hamuy et al. I2UU2I . as these cover all available spectral features 
uniformly over a wide range of times. 
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For each galaxy z, we set C^ — max(Cij), so that only 
the 'best-fitting' SN template is used to determine the 
likelihood of a given detection. The resulting value of CC 
quantifies the similarity between the shape of the residual 
spectrum and the shape of the best-fitting SN template. 

The RMS signal is also calculated from the low-pass fil- 
tered residual spectrum. This parameter provides a mea- 
sure of the amplitude of the signal present in the residual 
spectrum. 
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3. SIMULATIONS 



There are several criteria which determine whether a 
SN will be detectable in a given spectrum. One of the 
most important of these is the relative brightness of the SN 
compared with the host-galaxy - which will determine the 
prominence of the SN features in the observed composite 
host-galaxy -I- SN spectrum. 

In order to test how important these effects are we 
simulated several thousand composite 'host-galaxy-|-SN' 
spectra, by randomly choosing objects from the SDSS- 
DRl and adding our SNe templates. The flux in the 
galaxy spectra and SN templates was calculated over the 
wavelength range 5000 — 6OOOA, i.e. approximating the 
y-band, and SNe templates were added with fiux ratios 
(SN/galaxy) in the range 0.75-0.1. For each simulated 
spectrum we then ran our detection algorithm and calcu- 
lated both the CC and RMS values. It was determined 
from these simulations that our algorithms work very well 
at identifying a SN component in the spectra if the SN 
contributes > 10% of the F-band flux recorded in the 
SDSS fibre-spectrum (more details will be presented in 
Mortlock, Madgwick & Hewett 2003). 

An empirical cut in CC and RMS allows the detection 
of virtually all SNe present down to a relative flux ratio 
of 0.1 (corresponding to a magnitude difference Am ~ 2 
between the host-galaxy and SN). 

4. RESULTS 

We have calculated the CC and RMS parameters for all 
115,977 objects in SDSS-DRl with z < 0.25, Zconfidoncc > 
0.35 and a spectroscopic classification of 'galaxy'. 

Some 1500 SDSS-DRl spectra satisfied our selection cri- 
teria, representing ~ 1% of this spectroscopic galaxy sam- 
ple. However, although we inspected visually all of these 
spectra, high-confidence detections of SN were only made 
for objects which had CC > 0.6 - corresponding to only 
the 230 most significant detections. The detection algo- 
rithm has proved highly successful in identifying spectra 
with SNe present at high confidence. 

In total, 19 Type la SNe were identified with high con- 
fidence from our search. Another 15 possible detections 
were made, although further work is required both to 
quantify the true fraction of SNe in this sub-sample and 
to optimise the detection efficiency for the presence of SNe 
at low flux ratios. Properties of the high-confidence SNe 
detections are given in Tabled 
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Table 1 

Type Ia SNe discovered in SDSS DRl. In each case the SNe type, magnitude and ages have been derived using 
A wide variety of SNe templates (including 35 Type Ib/c and 41 Type Ia spectra). The quoted ages are the ±la 
range about the expected value. Also given are the flux ratios between the galaxy and SN components of 

EACH fibre-spectrum - AS DERIVED FROM THE SAME TEMPLATE LIBRARY. 



Name 




R.A. 




Dec 




VsN mag 


Flux ratio 




z 


Date observed 


SN ag 


e (days) 


2000fz 


01 


18 


35 


83 


+14 


41 


00 


5 


17.8 


0.40 





054 


2000-12-15 


6.3 


- 6.8 


2000ga 


02 


29 


03 


73 


-08 


24 


13 


7 


20.0 


0.14 





141 


2000-12-24 


6.0 


- 7.9 


2001kl 


03 


08 


51 


44 


-01 


10 


24 


1 


20.4 


0.06 





126 


2001-01-22 




20 


2001ks 


07 


56 


46 


50 


+36 


59 


17 





18.7 


0.12 





078 


2001-04-18 


-5.0 


-3.0 


2000fy 


08 


03 


12 


61 


+47 


36 


49 


7 


19.5 


0.19 





117 


2000-12-06 


2.5 


- 3.5 


2001kii 


08 


32 


09 


85 


+47 


17 


27 


6 


20.0 


0.17 





134 


2001-03-13 


-5.0 


-3.0 


2001km 


09 


11 


38 


38 


-00 


42 


54 





18.4 


0.56 





070 


2001-02-15 


6.0 


- 6.3 


2001ki 


09 


22 


29 


15 


+57 


54 


29 


3 


18.1 


0.30 





063 


2001-01-02 


10 


- 11 


2001kq 


09 


33 


34 


31 


+55 


10 


26 





18.5 


0.08 





074 


2001-03-23 


-4.9 


- -3.1 


2001kp 


09 


51 


53 


08 


+01 


06 


05 


8 


18.2 


0.34 





063 


2001-03-21 


-3.0 


--2.0 


2001kr 


09 


59 


15 


75 


+00 


58 


02 


4 


19.1 


0.43 





088 


2001-03-26 


11 


- 12 


2000fx 


10 


18 


00 


47 


-00 


01 


58 





18.7 


0.26 





065 


2000-12-01 


16 


- 18 


2001ki 


10 


48 


58 


40 


-00 


26 


45 


2 


19.3 


0.14 





106 


2001-01-01 


6.0 


- 7.3 


2001kk 


12 


47 


33 


41 


+00 


05 


57 


1 


18.8 


0.15 





086 


2001-01-19 


5.7 


- 7.8 


2001ku 


13 


39 


44 


77 


+62 


23 


37 





19.9 


0.15 





136 


2001-06-19 


3.3 


- 4.0 


2001ko 


14 


10 


58 


32 


+64 


50 


50 


9 


20.0 


0.15 





141 


2001-03-16 


7.6 


- 8.4 


2000fw 


14 


30 


14 


07 


+00 


30 


35 


4 


19.1 


0.28 





096 


2000-03-10 


-4.6 


- -3.3 


2001kt 


16 


17 


13 


40 


+48 


28 


27 


7 


19.6 


0.30 





104 


2001-05-25 


14 


- 17 


2000gb 


17 


32 


28 


54 


+56 


04 


25 


5 


20.4 


0.22 





123 


2000-10-01 


16 


- 20 
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Fig. 1. — Example spectra of the SNe identified in SDSS— DRl are shown in each panel. In addition, the lower plot in each panel shows 
the 'residual' spectrum that is left after subtracting the host-galaxy from the observed spectrum. The best-fitting template SN spectrum is 
shown overplotted on this residual spectrum. 
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5. THE COSMOLOGICAL SNE RATE 

A crude estimate of the number of SNe expected in the 
survey, (-/Vsn), is given by the product of four quantities: 
the number of galaxies in the survey, A'g; the average lu- 
minosity of a survey galaxy, (Lg); the typical control time 
for which a SN is sufficiently bright to be detectable, (^sn); 
and, of course, the SN rate itself (which is conventionally 
measured in supernova units, SNu^). Taking typical values 
for these quantities yields 



(iVsN) ^ 20 



iVg (£g) (tsN) r 

105 lOdayslSNu' 



(3) 



where ~ 10^°Lq (Madgwick et al. l2002a|l is the fidu- 
cial B-band luminosity of a field galaxy. The most difficult 
parameter to estimate is (^sn); which is determined from a 
combination of the SNe light-curve shapes, the positional 
distribution of the SNe in their host-galaxies, the size of 
the apertures used to obtain the spectra and the detection 
algorithm used. A value of (isN) — 20 days is consistent 
both with the best fit ages of the SNe listed in Tableland 
the more theoretical approach taken by Mortlock, Madg- 
wick & Hewett 20031 but is probably uncertain by up to 
50% in this preliminary analysis. 

The estimate of the SN rate from the discovery of 19 
type la SNe in the SDSS-DRl galaxy spectra is thus 



Tia = 0.4 ± 0.2h^ SNu, 



(4) 



where both the Poisson uncertainty in the sample size and 
our estimated uncertainty in the control time, (isN), are 
included, h is the value of the Hubble Constant in units of 
100 kms~^Mpc~^. Our estimate is consistent with previ- 
ous measurements at high redshifts: ria(2 = 0.4) = 0.8 ± 
0.5h^ SNu (Pain et al. E96), Ti^{z = 0.5) ~ 0.58 ± 0.2h^ 
SNu (Pain et al. 12002(1 . and is very similar to that previ- 
ously determi ned at low-^: ria(2; = 0.1) = 0.4±0.3/i^ SNu 
(Hardin et aL lMTUjl . 

6. CONCLUSIONS 

A novel spectroscopic analysis of the ~ 10^ galaxy spec- 
tra in the SDSS-DRl has resulted in the definite identifi- 
cation of 19 Type la SNe, along with a similar number of 
less certain detections. The resultant estimate of the local 
Type la SN rate, Fia ~ 0.4 ± 0.2h'^ SNu, is consistent with 
that obtained by more conventional imaging methods, al- 
though the uncertainties in all cases are dominated by the 
small number of SNe. 

The future for spectroscopic SNe searches is promising, 
as this methodology can be applied to any redshift survey 
for which a large sample of galaxy spectra exist, opening 
up new scientific applications for present and future sur- 
veys. A more complete analysis of the SDSS-DRl spectra 
will include a more sophisticated estimate of the Cosmo- 
logical SNe rate (Mortlock, Madgwick & Hewett 20Q3j), 
along with a number of consistency checks in which the 
magnitudes, redshifts and epochs (relative to maximum 
light) of the detected SNe are compared to the predicted 
distributions. 

Looking ahead, the ease with which these Type la SNe 
have been discovered in the SDSS-DRl spectra implies 

^1 SNu is defined as the number of SNe per century per 10^'' Solar 



that the full survey of ~ 10^ galaxies should yield at least 
~ 200 SNe of all types. Such a sample should provide 
an estimate of T which is largely free of systematic errors 
and subject to only a small statistical uncertainty. Fur- 
thermore, the spectroscopic selection applied to the SDSS 
sample will allow the frequency of SNe events in terms of 
the properties of the underlying galaxy population to be 
determined to unprecedented accuracy (see e.g. Cappel- 
laro et al. 1997, Cappellaro, Evans & Turatto 1993) . 

A final important point to make is that all of these SNe 
have been discovered several years after their occurrence, 
and as such follow-up observations are simply not possible. 
However, by incorporating our search methodology into 
the SDSS (and other survey's) data reduction pipelines 
these objects can be discovered essentially immediately. It 
is hoped that the presentation of the results in this Letter 
will result in this occurring, so that this interesting aspect 
of galaxy redshift surveys no longer goes neglected. 
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